Parsing named entity as syntactic structure
نویسندگان
چکیده
Named entity recognition (NER) plays an important role in many natural language processing applications. This paper presents a novel approach to Chinese NER. It differentiates from most of the previous approaches mainly in three respects. First of all, while previous work is good at modeling features between observation elements, our model incorporates syntactic structure as higher level information. It is crucial for recognizing long named entities, which are one of the main difficulties of NER. Secondly, NER and syntactic analysis have been modeled separately in natural language processing until now. We integrate them in a unified framework. It allows the information from each type of annotation to improve performance on the other, and produces the consistent output. Finally, few studies have been reported on the recognition of nested named entities in Chinese. This paper presents a structured prediction model for Chinese nested named entity recognition. Our approach have been implemented through a joint representation of syntactic and named entity structures. We have provided empirical evidence that parsing model can utilize syntactic constraints for recognizing named entities, and exploit the composition patterns of named entities. Experiment results demonstrate the mutual benefits for each task and output syntactic structure of named entities.
منابع مشابه
A Novel Approach to Conditional Random Field-based Named Entity Recognition using Persian Specific Features
Named Entity Recognition is an information extraction technique that identifies name entities in a text. Three popular methods have been conventionally used namely: rule-based, machine-learning-based and hybrid of them to extract named entities from a text. Machine-learning-based methods have good performance in the Persian language if they are trained with good features. To get good performanc...
متن کاملCorpus linguistics meets language technology:
To the extent that NLP is used by QA systems, it is mostly limited to tokenization, named entity recognition, stemming, POS tagging, and shallow parsing. More sophisticated NLP such as (deep) syntactic parsing is hardly ever used. In the present paper I investigate why this should be the case and try to establish how deep syntactic parsing as developed in the field of corpus linguistics might c...
متن کاملMAIMAI: A Question Answering System at NTCIR3 QAC-1
This paper describes an question answering system based on syntactic information. Our system extracts answer candidates by ranking of score which shows similarity of syntactic structure. Syntactic structure is estimated based on answer type, density of weighty words, distance between words and depth of parse tree. To analyze syntactic structure, morphological analysis, named entity extraction a...
متن کاملIntertwining Deep Syntactic Processing and Named Entity Detection
In this paper, we present a robust incremental architecture for natural language processing centered around syntactic analysis but allowing at the same time the description of specialized modules, like named entity recognition. We show that the flexibility of our approach allows us to intertwine general and specific processing, which has a mutual improvement effect on their respective results: ...
متن کاملGrammarless Parsing for Joint Inference
Many NLP tasks interact with syntax. The presence of a named entity span, for example, is often a clear indicator of a noun phrase in the parse tree, while a span in the syntax can help indicate the lack of a named entity in the spans that cross it. For these types of problems joint inference offers a better solution than a pipelined approach, and yet large joint models are rarely pursued. In t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014